Grid Data Management: Simulations of Lcg 2008
نویسندگان
چکیده
Simulations have been performed with the grid simulator OptorSim using the expected analysis patterns from the LHC experiments and a realistic model of the LCG at LHC startup, with thousands of user analysis jobs running at over a hundred grid sites. It is shown, first, that dynamic data replication plays a significant role in the overall analysis throughput in terms of optimising job throughput and reducing network usage; second, that simple file deletion algorithms such as LRU and LFU algorithms are as effective as economic models; third, that site policies which allow all experiments to share resources in a global Grid is more effective in terms of data access time and network usage; and lastly, that dynamic data management applied to user data access patterns where particular files are accessed more often (characterised by a Zipf power law function) lead to much improved performance compared to sequential access.
منابع مشابه
The Sam-grid / Lcg Interoperability System: a Bridge between Two Grids
The SAM-Grid system is an integrated data, job, and information management infrastructure. The SAM-Grid addresses the distributed computing needs of the experiments of RunII at Fermilab. The system typically relies on SAM-Grid services deployed at the remote facilities in order to manage computing resources. Such deployment requires special agreements with each resource provider and it is a lab...
متن کاملLCG Data Management: From EDG to EGEE
The Large Hadron Collider (LHC) at CERN, the European Organisation for Nuclear Research, will produce unprecedented volumes of data when it starts operation in 2007. To provide for its computational needs, the LHC Computing Grid (LCG) is being deployed as a worldwide computational grid service, providing the middleware upon which the physics analysis for the LHC will be carried out. Data manage...
متن کاملFile Management for HEP Data Grids
The next generation of high energy physics experiments, such as the Large Hadron Collider (LHC) at CERN, the European Organization for Nuclear Research, pose a challenge to current data handling methodologies, where data tends to be centralised in a single location. Data grids, including the LHC Computing Grid (LCG), are being developed to meet this challenge by unifying computing and storage r...
متن کاملDirac Infrastructure for Distributed Analysis
DIRAC is the LHCb Workload and Data Management system for Monte Carlo simulation, data processing and distributed user analysis. Using DIRAC, a variety of resources may be integrated, including individual PC’s, local batch systems and the LCG grid. We report here on the progress made in extending DIRAC for distributed user analysis on LCG. In this paper we describe the advances in the workload ...
متن کاملar X iv : c s . D C / 0 31 10 21 v 1 1 7 N ov 2 00 3 LCG - 1 Deployment and usage experience
LCG-1 is the second release of the software framework for the LHC Computing Grid project. In our work we describe the installation process, arising problems and their solutions, and configuration tuning details of the complete LCG-1 site, including all LCG elements required for the self-sufficient site. 1 Brief introduction to LCG-1 LHC Computing Grid (LCG) is one of the five CERN projects at t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006